翻訳と辞書
Words near each other
・ Russian Mountains (disambiguation)
・ Russian Multi-Purpose Salvage Vessels
・ Russian Museum
・ Russian Museum of Ethnography
・ Russian Museum of Military Medicine
・ Russian music
・ Russian Music Charts
・ Russian Music Competition
・ Russian Musical Society
・ Russian Muslims
・ Russian myths
・ Russian Narval-class submarine
・ Russian National Agency for Energy Saving and Renewable Energy
・ Russian National Autonomous Party
・ Russian National Badminton Championships
Russian National Corpus
・ Russian National Orchestra
・ Russian National Party
・ Russian National Public Library for Science and Technology
・ Russian National Research Medical University
・ Russian National Road Race Championships
・ Russian National Socialist Party
・ Russian National Time Trial Championships
・ Russian National Union
・ Russian National Unity
・ Russian National Wealth Fund
・ Russian nationalism
・ Russian Naval Aviation
・ Russian naval facility in Tartus
・ Russian Naval General Staff


Dictionary Lists
翻訳と辞書 辞書検索 [ 開発暫定版 ]
スポンサード リンク

Russian National Corpus : ウィキペディア英語版
Russian National Corpus
The Russian National Corpus (English official name; the Russian name is Национальный корпус русского языка, lit. the National Corpus of the Russian language, but as the official English variant the Russian National Corpus is used) is a corpus of the Russian language that has been partially accessible through a query interface online since April 29, 2004. It is being created by the Institute of Russian language, Russian Academy of Sciences.
It currently contains about 350 million word forms that are automatically lemmatized and POS-/grammeme-tagged, i. e. all the possible morphological analyses for each orthographic form are ascribed to it. Lemmata, POS, grammatical items and their combinations are searchable. Additionally, 6 million word forms are in the subcorpus with manually resolved homonymy.
The subcorpus with resolved morphological homonymy is also automatically accentuated. The whole corpus has a searchable tagging concerning lexical semantics (LS), including morphosemantic POS subclasses (proper noun, reflexive pronoun etc.), LS characteristics proper (thematic class, causativity, evaluation), derivation (diminutive, adverb formed from adjective etc.).
The RNC includes also the following subcorpora:
*a treebank of syntactical dependencies (largely based on the Igor Mel'čuk's Meaning-Text Theory)
*English⇔Russian, German⇒Russian, Ukrainian⇔Russian and Belorussian⇔Russian parallel corpora;
*a large (100+ million words) separate corpus of modern newspapers (2001–2011);
*a corpus of Russian poetry, where the rhyming words and poetic prosody (including meter, stanzas etc.) is additionally tagged;
*a corpus of Russian dialects with specific dialect grammar tagging;
*a multimedia corpus with searchable tagged fragments of Russian-language movies;
*a corpus showing the history of Russian stress
*an educational subcorpus reflecting school standards.
All the texts have tags bearing metatextual information - the author, his/her birth date, creation date, text size, text genres (general fiction, detective story, newspaper article etc.); all these categories are browsable and searchable separately. It is possible to define a user's subcorpus to search lemmata/POS-grammeme/semantic tags combinations only within this subset.
==References==


抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)
ウィキペディアで「Russian National Corpus」の詳細全文を読む



スポンサード リンク
翻訳と辞書 : 翻訳のためのインターネットリソース

Copyright(C) kotoba.ne.jp 1997-2016. All Rights Reserved.